Improving the Performance of CP2K on the Cray XT

نویسنده

  • Iain Bethune
چکیده

CP2K is a freely available and increasingly popular Density Functional Theory code for the simulation of a wide range of systems. It is heavily used on many Cray XT systems, including ‘HECToR’ in the UK and ‘Monte Rosa’ in Switzerland. We describe performance optimisations made to the code in several key areas, including 3D Fourier Transforms, and present the implementation of a load balancing scheme for multi-grids. These result in performance gains of around 30% on 256 cores (for a generally representative benchmark) and up to 300% on 1024 cores (for non-homogeneous systems). Early results from the implementation of hybrid MPI/OpenMP parallelism in the code are also presented.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

CP2K Performance from Cray XT3 to XC30

CP2K is a powerful open-source program for atomistic simulation using a range of methods including Classical potentials, Density Functional Theory based on the Gaussian and Plane Waves approach, and post-DFT methods. CP2K has been designed and optimised for large parallel HPC systems, including a mixed-mode MPI/OpenMP parallelisation, as well as CUDA kernels for particular types of calculations...

متن کامل

Improving the scalability of CP2K on multi-core systems A dCSE Project

Six months of HECToR dCSE funding was given to implement mixed-mode OpenMP parallelism in CP2K, building on the results of an earlier successful dCSE project. Improved scalability of up to 8 times as many cores was demonstrated for a small benchmark, and a larger, inhomogeneous benchmark was shown to scale up to 9000+ cores. An increase in peak performance of up to 60% was also realised on HECT...

متن کامل

The Cray XT Compilers

The Cray XT3 and Cray XT4 supports compilers from the Portland Group, PathScale, and the GNU Compiler Collection. The goal of the paper is to provide Cray XT users with an overview of the different compilers available in the Cray XT Programming Environment. Discussion will highlight the feature and performance differences between the compilers and provide guidelines in determining which compiler

متن کامل

Comparison of Cray XT 3 and XT 4 Scalability ∗

The Cray XT3 and XT4 have similar architectures, differing primarily in memory performance and in bandwidth between the node and interconnect. This paper evaluates and compares the scalability of the XT3 and XT4. Kernel benchmarks are used to verify and to quantify the performance differences between the systems. Application benchmarks are used to examine the impact of these differences on scal...

متن کامل

Scalable performance analysis of large-scale parallel applications on Cray XT systems with Scalasca

The open-source Scalasca toolset (available from www.scalasca.org) supports integrated runtime summarization and automated trace analysis on a diverse range of HPC computer systems. An HPC-Europa2 visit to EPCC in 2009 resulted in significantly enhanced support for Cray XT systems, particularly the auxilliary programming environments and hybrid OpenMP/MPI. Combined with its previously demonstra...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2010